Accurate in silico identification of species-specific acetylation sites by integrating protein sequence-derived and functional features
نویسندگان
چکیده
Lysine acetylation is a reversible post-translational modification, playing an important role in cytokine signaling, transcriptional regulation, and apoptosis. To fully understand acetylation mechanisms, identification of substrates and specific acetylation sites is crucial. Experimental identification is often time-consuming and expensive. Alternative bioinformatics methods are cost-effective and can be used in a high-throughput manner to generate relatively precise predictions. Here we develop a method termed as SSPKA for species-specific lysine acetylation prediction, using random forest classifiers that combine sequence-derived and functional features with two-step feature selection. Feature importance analysis indicates functional features, applied for lysine acetylation site prediction for the first time, significantly improve the predictive performance. We apply the SSPKA model to screen the entire human proteome and identify many high-confidence putative substrates that are not previously identified. The results along with the implemented Java tool, serve as useful resources to elucidate the mechanism of lysine acetylation and facilitate hypothesis-driven experimental design and validation.
منابع مشابه
Functional Assessment of CODM Gene in Different Cultivar of Papaveraceous Species Via In Silico Analysis
Medicinal use of the opium poppy (Papaver somniferum L) has ancient history, but the isolation of morphine was not described until the early nineteenth century. Morphine is the most important alkaloid of opium poppy in the last 50 years. In the morphine pathway has been reported to generate morphine in this species, CODM has a crucial role as the gene coding the enzyme respons...
متن کاملFunctional Assessment of CODM Gene in Different Cultivar of Papaveraceous Species Via In Silico Analysis
Medicinal use of the opium poppy (Papaver somniferum L) has ancient history, but the isolation of morphine was not described until the early nineteenth century. Morphine is the most important alkaloid of opium poppy in the last 50 years. In the morphine pathway has been reported to generate morphine in this species, CODM has a crucial role as the gene coding the enzyme respons...
متن کاملIn silico investigation of lactoferrin protein characterizations for the prediction of anti-microbial properties
Lactoferrin (Lf) is an iron-binding multi-functional glycoprotein which has numerous physiological functions such as iron transportation, anti-microbial activity and immune response. In this study, different in silico approaches were exploited to investigate Lf protein properties in a number of mammalian species. Results showed that the iron-binding site, DNA and RNA-binding sites, signal pepti...
متن کاملImproved Species-Specific Lysine Acetylation Site Prediction Based on a Large Variety of Features Set
Lysine acetylation is a major post-translational modification. It plays a vital role in numerous essential biological processes, such as gene expression and metabolism, and is related to some human diseases. To fully understand the regulatory mechanism of acetylation, identification of acetylation sites is first and most important. However, experimental identification of protein acetylation sit...
متن کاملتجزیه و تحلیل فیلوژنتیکی و آنالیز In Silico پروتئین اینترفرون بتا 1 بی
Background and purpose: Interferon beta-1b recombinant protein is used for reducing the relapse rate and treatment in patients with Multiple sclerosis (MS). In this study, phylogenetic and in silico analysis of interferon beta-1b were conducted by servers and bioinformatics tools to predict its structural potential. Materials and methods: Physiological and physico-chemical characteristics of...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 4 شماره
صفحات -
تاریخ انتشار 2014